Multiple-UAV Reinforcement Learning Algorithm Based on Improved PPO in Ray Framework
نویسندگان
چکیده
Distributed multi-agent collaborative decision-making technology is the key to general artificial intelligence. This paper takes self-developed Unity3D combat environment as test scenario, setting a task that requires heterogeneous unmanned aerial vehicles (UAVs) perform distributed and complete cooperation task. Aiming at problem of traditional proximal policy optimization (PPO) algorithm’s poor performance in field complex collaboration scenarios based on training framework Ray, Critic network PPO algorithm improved learn centralized value function, muti-agent (MAPPO) proposed. At same time, inheritance method course learning adopted improve generalization algorithm. In experiment, MAPPO can obtain highest average accumulate reward compared with other algorithms goal fewest steps after convergence, which fully demonstrates outperforms state-of-the-art.
منابع مشابه
Operation Scheduling of MGs Based on Deep Reinforcement Learning Algorithm
: In this paper, the operation scheduling of Microgrids (MGs), including Distributed Energy Resources (DERs) and Energy Storage Systems (ESSs), is proposed using a Deep Reinforcement Learning (DRL) based approach. Due to the dynamic characteristic of the problem, it firstly is formulated as a Markov Decision Process (MDP). Next, Deep Deterministic Policy Gradient (DDPG) algorithm is presented t...
متن کاملDynamic Obstacle Avoidance by Distributed Algorithm based on Reinforcement Learning (RESEARCH NOTE)
In this paper we focus on the application of reinforcement learning to obstacle avoidance in dynamic Environments in wireless sensor networks. A distributed algorithm based on reinforcement learning is developed for sensor networks to guide mobile robot through the dynamic obstacles. The sensor network models the danger of the area under coverage as obstacles, and has the property of adoption o...
متن کاملWeb pages ranking algorithm based on reinforcement learning and user feedback
The main challenge of a search engine is ranking web documents to provide the best response to a user`s query. Despite the huge number of the extracted results for user`s query, only a small number of the first results are examined by users; therefore, the insertion of the related results in the first ranks is of great importance. In this paper, a ranking algorithm based on the reinforcement le...
متن کاملMultiple Model-Based Reinforcement Learning
We propose a modular reinforcement learning architecture for nonlinear, nonstationary control tasks, which we call multiple model-based reinforcement learning (MMRL). The basic idea is to decompose a complex task into multiple domains in space and time based on the predictability of the environmental dynamics. The system is composed of multiple modules, each of which consists of a state predict...
متن کاملAutonomous UAV Navigation Using Reinforcement Learning
Unmanned aerial vehicles (UAV) are commonly used for missions in unknown environments, where an exact mathematical model of the environment may not be available. This paper provides a framework for using reinforcement learning to allow the UAV to navigate successfully in such environments. We conducted our simulation and real implementation to show how the UAVs can successfully learn to navigat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Drones
سال: 2022
ISSN: ['2504-446X']
DOI: https://doi.org/10.3390/drones6070166